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1 Fast detection of communication patterns in distributed executions 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Adva need 
Studies on Collaborative research 

Publisher: IBM Press 

Full text available: ^ pdf(4.21 MB) Additional Information: full citation , abstract , r eferences , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based 
on process-time diagrams are often used to obtain a better understanding of the 
execution of the application. The visualization tool we use is Poet, an event tracer 
developed at the University of Waterloo. However, these diagrams are often very complex 
and do not provide the user with the desired overview of the application. In our 
experience, such tools display repeated occurrences of non-trivial'commun ... 



Extending OLAP querying to external object databases 

Torben Bach Pedersen, Arie Shoshani, Junmin Gu, Christian S. Jensen 

November 2000 Proceedings of the ninth international conference on Information and 
knowledge management 

Publisher: ACM Press 

Full text available: ^g] pdf(168.32 KB) Additional Information: full citation , references, citings, index terms 



Hierarchies and relative operators in the OLAP environment 

Elaheh Pourabbas, Maurizio Rafanelli 

March 2000 ACM SZGMOD Record, Volume 29 Issue 1 

Publisher: ACM Press 

Full text available: ^ pdf(585.68 KB) Additional Information: full citation , abstract , citings , index terms 

In the last few years, numerous proposals for modelling and querying Multidimensional 
Databases (MDDB) are proposed. A rigorous classification of the different types of 
hierarchies is still an open problem. In this paper we propose and discuss some different 
types of hierarchies within a single dimension of a cube. These hierarchies divide in 
different levels of aggregation a single dimension. Depending on them, we discuss the 
characterization of some OLAP operators that refer to hierarchie ... 



OLA P and statis tic al databases: similarities and differ ence s 
Arie Shoshani 
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May 1997 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on 

Principles of database systems 
Publisher: ACM Press 

Full text available: ^ | pdf(1.66 MB) Additional Information: full citation , references , citings , index terms 



An overview of data warehousing and OLAP technolog y 
Surajit Chaudhuri, Umeshwar Dayal 
March 1997 ACM SIGMOD Record, volume 26 issue l 

Publisher: ACM Press 

Full text available: ^ pdf(101.60 KB) Additional Information: full citation , abstract , citings , index terms 

Data warehousing and on-line analytical processing (OLAP) are essential elements of 
decision support, which has increasingly become a focus of the database industry. Many 
commercial products and services are now available, and all of the principal database 
management system vendors now have offerings in these areas. Decision support places 
some rather different requirements on database technology compared to traditional on- 
line transaction processing applications. This paper provides an overview ... 

6 Queryin g OLAP databases: Modeling /querying and reas onin g about OLAP 

^ databases: a functional approach 
^ Ken Q. Pu 

November 2005 Proceedings of the 8th ACM international workshop on Data 
warehousing and OLAP DOLAP '05 

Publisher: ACM Press 

Full text available: ^ pdf(257.43 KB) Additional Information: full citation, abstract, references, index terms 

We propose a new functional framework for modeling, querying and reasoning about 
OLAP databases. The framework represents data (data cubes and dimensional 
hierarchies) and querying constructs as first-order and second-order functional symbols 
respectively. A polymorphic attribute-based type system is used to annotate the 
functional symbols with proper type information. Furthermore, semantic knowledge about 
the functional symbols, such as the properties of dimensional hierarchical structures- 
and ... 

Keywords: OLAP, query-language, summarizability 
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OLAP: An analysis of additivity in OLAP systems 
John Horner, Il-Yeol Song, Peter P. Chen 

November 2004 Proceedings of the 7th ACM international workshop on Data 
warehousing and OLAP 

Publisher: ACM Press 

Full text available: ^pdf(322.41 KB) Additional Information: full citation, abstract , references , index terms 

Accurate summary data is of paramount concern in data warehouse systems; however, 
there have been few attempts to completely characterize the ability to summarize 
measures. The sum operator is the typical aggregate operator for summarizing the large 
amount of data in these systems. We look to uncover and characterize potentially 
inaccurate summaries resulting from aggregating measures using the sum operator. We 
discuss the effect of classification hierarchies, and non-, semi-, and fully- addi ... 

Keywords: OLAP, additivity, data warehouse, metadata, summarization 
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database 

Zina Ben-Miled, Yang Liu, Michael Bern, Robert Jones, Robert Oppelt, Samuel Milosevich, 
Dave Powers, Omran Bukhres 

November 2000 Proceedings of the 2000 ACM/IEEE conference on Supercomputing 
(CDROM) 

Publisher: IEEE Computer Society 

Full text available: figl pdf(252.19 KB) 

i§r~ Additional Information: full citation , abstract , references , index terms 

W Publisher Site 

An explosion in the amount of data generated through chemical and biological 
experimentation has been observed in recent years. This rapid proliferation of vast 
amounts of data has led to a set of cheminformatics and bioinformatics applications that 
manipulate dynamic, heterogeneous and massive data. An example of such applications in 
the pharmaceutical industry is the computational process involved in the early discovery 
of lead drug candidates for a given target disease. This computationa ... 



Keywords: cheminformatics, bioinformatics,multithreading, SMP, databases 



Searching for dependencies at multiple abstraction levels 
Toon Calders, Raymond T. Ng, Jef Wijsen 

September 2002 ACM Transactions on Database Systems (TODS), Volume 27 issue 3 
Publisher: ACM Press 

Full text available: Wi pdf( 411.24 KB) AdditionaI Information: full citation, abstract, references, citings, index 

terms 

The notion of roll-up dependency (RUD) extends functional dependencies with 
generalization hierarchies. RUDs can be applied in OLAP and database design. The 
problem of discovering RUDs in large databases is at the center of this paper. An 
algorithm is provided that relies on a number of theoretical results. The algorithm has 
been implemented; results on two real-life datasets are given. The extension of functional 
dependency (FD) with roll-ups turns out to capture meaningful rules that are outsi ... 

Keywords: Data mining, functional dependencies, knowledge discovery, online analytical 
processing 



1 o GPGPU: g eneral purpose co mputation on graphics hardware 

David Luebke, Mark Harris, Jens Kruger, Tim Purcell, Naga Govindaraju, Ian Buck, Cliff 

^ Woolley, Aaron Lefohn 

August 2004 Proceedings of the conference on SIGGRAPH 2004 course notes GRAPH 
'04 

Publisher: ACM Press 

Full text available: ^]pdf( 63.03 MB) Additional Information: full citation , ab str act 

The graphics processor (GPU) on today's commodity video cards has evolved into an 
extremely powerful and flexible processor. The latest graphics architectures provide 
tremendous memory bandwidth and computational horsepower, with fully programmable 
vertex and pixel processing units that support vector operations up to full IEEE floating 
point precision. High level languages have emerged for graphics hardware, making this 
computational power accessible. Architecturally, GPUs are highly parallel s ... 



1 Progressive skyline computation in database systems 

Dimitris Papadias, Yufei Tao, Greg Fu, Bernhard Seeger 
r March 2005 ACM Transactions on Database Systems (TODS), Volume 30 issue l 

Publisher: ACM Press 
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Full text available: ^ pdf(913.10 KB) Additional Information: full citation , abstract , references , index terms 

The skyline of a tf-dimensional dataset contains the points that are not dominated by any 
other point on all dimensions. Skyline computation has recently received considerable 
attention in the database community, especially for progressive methods that can quickly 
return the initial results without reading the entire database. All the existing algorithms, 
however, have some serious shortcomings which limit their applicability in practice. In 
this article we develop branch-and-bound skyline ... 

Keywords: Skyline query, branch-and-bound algorithms, multidimensional access 
methods 



12 Research session: OLAP: OLAP over uncertain and imprecise data 

Doug Burdick, Prasad M. Deshpande, T. S. Jayram, Raghu Ramakrishnan, Shivakumar 
Vaithyanathan 

August 2005 Proceedings of the 31st international conference on Very large data 
bases VLDB '05 

Publisher: VLDB Endowment 

Full text available: ^pdf(434.01 KB) Additional Information: full citation , abstract , references , index terms 

We extend the OLAP data model to represent data ambiguity, specifically imprecision and 
uncertainty, and. introduce an allocation-based approach to the semantics of aggregation 
queries over such data. We identify three natural query properties and use them to shed 
light on alternative query semantics. While there is much work on representing and 
querying ambiguous data, to our knowledge this is the first paper to handle both 
imprecision and uncertainty in an OLAP setting. 

1 3 Database theory, technology and ap pli cations (DT TA): Creation and mana g ement of 
versions in multiversion data warehouse 

Bartosz BDbel, Johann Eder, Christian Koncilia, Tadeusz Morzy, Robert Wrembel 
March 2004 Proceedings of the 2004 ACM symposium on Applied computing 
Publisher: ACM Press 

Full text available: ffi odf(516.99 KB ) Additional Information: fuLgtation, abstract, refere nces, cjtings, index 

terms 

A data warehouse (DW) provides an information for analytical processing, decision 
making, and data mining tools. On the one hand, the structure and content of a data 
warehouse reflects a real world, i.e. data stored in a DW come from real production 
systems. On the other hand, a DW and its tools may be used for predicting trends and 
simulating a virtual business scenarios. This activity is often called the what-if analysis. 
Traditional DW systems have static structure of their schemas and relati ... 

Keywords: data warehouse, integrity constraints, versioning 



14 A case for dynamic view management 
^■k Yannis Kotidis, Nick Roussopoulos 

>^ December 2001 ACM Transactions on Database Systems (TODS), volume 26 issue 4 
Publisher: ACM Press 

Full text available: fg l pdf(892.57 KB) Additional Information: full citation , abstract, references , dtjngs, index 

terms , review 

Materialized aggregate views represent a set of redundant entities in a data warehouse 
that are frequently used to accelerate On-Line Analytical Processing (OLAP). Due to the 
complex structure of the data warehouse and the different profiles of the users who 
submit queries, there is need for tools that will automate and ease the view selection and 
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management processes. In this article we present DynaMat, a system that manages 
dynamic collections of materialized aggregate views in a data warehous ... 

Keywords: Data cube, OLAP, data warehousing, materialized views 



15 Compressed data cubes for OLAP aggregate query approximation on continuous 
dimensions 

Jayavel Shanmugasundaram, Usama Fayyad, P. S. Bradley 

August 1999 Proceedings of the fifth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Publisher: ACM Press 

Full text available: ^ pdf(1.12 MB) Additional Information: full citation , references , citings , index terms 



Keywords: OLAP, approximate query answering, clustering, data cubes, data mining, 
density estimation 



16 Querying OLAP databases: A personalization framework for OLAP queries | 
Ladjel Bellatreche, Arnaud Giacometti, Patrick Marcel, Hassina Mouloudi, Dominique Laurent * 
November 2005 Proceedings of the 8th ACM international workshop on Data 

warehousing and OLAP DOLAP '05 
Publisher: ACM Press 

Full text available: ^ pdf(227.64 KB) Additional Information: full citation , abstract , references , index terms 

OLAP users heavily rely on visualization of query answers for their interactive analysis of 
massive amounts of data. Very often, these answers cannot be visualized entirely and the 
user has to navigate through them to find relevant facts. In this paper, we propose a 
framework for personalizing OLAP queries. In this framework, the user is asked to give his 
(her) preferences and a visualization constraint, that can be for instance the limitations 
imposed by the device used to display ... 

Keywords: OLAP, multidimensional databases, personalization, preferences, visualization 



1 7 S chem aSQL: An exten s ion to SQL for multidatabase in t eroperability | 
Jj^ Laks V. S. Lakshmanan, Fereidoon Sadri, Subbu N. Subramanian 

December 2001 ACM Transactions on Database Systems (TODS), volume 26 issue 4 

Publisher: ACM Press 

c ii* * i u. «m ^/„oc on ud\ Additional Information: full citation , abstract , references , citings , index 

Full text available: pdf(435.89 KB) 

™ terms , review 

We provide a principled extension of SQL, called SchemaSQL, that offers the capability of 
uniform manipulation of data and schema in relational multidatabase systems. We 
develop a precise syntax and semantics of SchemaSQL in a manner that extends 
traditional SQL syntax and semantics, and demonstrate the following. (1) SchemaSQL 
retains the flavor of SQL while supporting querying of both data and schema. (2) It can be 
used to transform data in a database in a structure substa ... 

Keywords: Information integration, SchemaSQL, multidatabase systems, restructuring 
views, schematic heterogeneity 

18 NSF workshop on industrial/academic cooperation in database systems 
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>v Mike Carey, Len Seligman 

March 1999 ACM SIGMOD Record, Volume 28 Issue 1 
Publisher: ACM Press 

Full text available: pdf(1.96 MB) Additional Information: full citation , index terms 



19 Data warehousing and OLAP: A fast filtering scheme for large database cleansing 
Sam Y. Sung, Zhao Li, Peng Sun 

November 2002 Proceedings of the eleventh international conference on Information 
and knowledge management 

Publisher: ACM Press 

Full text available: ^ pdf(1 59.98 KB) Additional Information: full citation , abstract , references , index terms 

Existing data cleansing methods are costly and will take very long time to cleanse large 
databases. Since large databases are common nowadays, it is necessary to reduce the 
cleansing time. Data cleansing consists of two main components, detection method and 
comparison method. In this paper, we first propose a simple and fast comparison method, 
Tl-Simllarity, which reduces the time for each comparison. Based on Tl-Similarity, we 
propose a new detection method, RAR, to further reduce ... 

Keywords: data cleansing, duplicate elimination, filtering scheme, similarity 



20 Research session 5: data mining / transaction mana g ement: Multi-structural 
^ databases 

^ Ronald Fagin, R. Guha, Ravi Kumar, Jasmine Novak, D. Sivakumar, Andrew Tomkins 
June 2005 Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART 

symposium on Principles of database systems 
Publisher: ACM Press 

Full text available: ^| pdf(313.14 KB) Additional Information: full citation , abstract , references 

We introduce the Multi-Structural Database, a new data framework to support efficient 
analysis of large, complex data sets. An instance of the model consists of a set of data 
objects, together with a schema that specifies segmentations of the set of data objects 
according to multiple distinct criteria (e.g., into a taxonomy based on a hierarchical 
attribute). Within this model, we develop a rich set of analytical operations and design 
highly efficient algorithms for these opera ... 
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1 A multidimensional m odeling a pproach for OLAP within the framework of the 
relational model based on quotient relations 
O. Mangisengi, A. M. Tjoa 

November 1998 Proceedings of the 1st ACM international workshop on Data 
warehousing and OLAP 

Publisher: ACM Press 

Full text available: ^ pdf(645.72 KB) Additional Information: full citation , references, index terms 



2 OLAP and statistical databases: similarities and differences Q 




Arie Shoshani 

May 1997 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on 



Principles of database systems 
Publisher: ACM Press 

Full text available: ^|?) pdf( 1 .66 MB ) Additional Information: full citation, re ferences , citings, inde x terms 



3 Data warehousing and OLAP: Semantic-based delivery of OLAP summary tables in 

g> wireless environments 

^ Mohamed A. Sharaf, Panos K. Chrysanthis 

November 2002 Proceedings of the eleventh international conference on Information 
and knowledge management 

Publisher: ACM Press 

Full text available: fg] pdf(251 10 KB) Additional Information: full citation , abstract , references , citings, index 
^ : terms 

With the rapid growth in mobile and wireless technologies and the availability, 
pervasiveness and cost effectiveness of wireless networks, mobile computers are quickly 
becoming the normal front-end devices for accessing enterprise data. In this paper, we 
are addressing the issue of efficient delivery of business decision support data in the form 
of summary tables to mobile clients equipped with OLAP front-end tools. Towards this, we 
propose a new on-demand scheduling algorithm, called SBS ... 

Keywords: broadcast pull, broadcast scheduling, mobile computing 
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4 Tables as a paradigm for querying and restructuring (extended abstract) 




Marc Gyssens, Laks V. S. Lakshmanan, Iyer N. Subramanian 

June 1996 Proceedings of the fifteenth ACM SIGACT-SIGMOD-SIGART symposium on 



Principles of database systems 
Publisher: ACM Press 

Full text available: ^ pdf(1.10 MB) Additional Information: full citation , references , citings , index terms 



5 Industrial session: data warehousing and data mining: Bridging the gap between 
OLAP and SQL 

Jens-Peter Dittrich, Donald Kossmann, Alexander Kreutz 

August 2005 Proceedings of the 31st international conference on Very large data 
bases VLDB '05 

Publisher: VLDB Endowment 

Full text available: ^pdf(409.18 KB) Additional Information: full citation , abstract , references , index terms 

In the last ten years, database vendors have invested heavily in order to extend their 
products with new features for decision support. Examples of functionality that has been 
added are top N [2], ranking [13, 7], spreadsheet computations [19], grouping sets [14], 
data cube [9], and moving sums [15] in order to name just a few. Unfortunately, many 
modern OLAP systems do not use that functionality or replicate a great deal of it in 
addition to other database-related functionality. In fact, the ga ... 

6 An overview of data warehousing and OLAP technology 
£l Surajit Chaudhuri, Umeshwar Dayal 

March 1997 ACM SIGMOD Record, Volume 26 Issue 1 
Publisher: ACM Press 

Full text available: ^[pdf(101.60 KB) Additional Information: full citation , abstract , citings , index terms 

Data warehousing and on-line analytical processing (OLAP) are essential elements of 
decision support, which has increasingly become a focus of the database industry. Many 
commercial products and services are now available, and all of the principal database 
management system vendors now have offerings in these areas. Decision support places 
some rather different requirements on database technology compared to traditional on- 
line transaction processing applications. This paper provides an overview ... 

7 A powerful and SQL-compatible data model and q uery langu a ge f o r OL A P Q 
Dennis Pedersen, Karsten Riis, Torben Bach Pedersen 

January 2002 Australian Computer Science Communications , Proceedings of the 

thirteenth Australasian conference on Database technologies - Volume 
5 CRPITS '02, Volume 24 Issue 2 

Publisher: Australian Computer Society, Inc. , IEEE Computer Society Press 

Full text available: fj3pdf(1 12 MB) Additional Information: full citation , abstract , references , citings , index 

t erms 

In this paper we present the SQLAf OLAP data model, formal algebra, and query language 
that, unlike current OLAP data models and languages, are both powerful, meaning that 
they support irregular dimension hierarchies, automatic aggregation of data, and correct 
aggregation of data, and SQL-compatible, allowing seamless integration with relational 
technology. We also consider the requirements to the data model posed by integration of 
OLAP data with external XML data. ... 

Keywords: OLAP, data integration, data models, multidimensional databases, query 
languages 
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8 Heuristic optimization of OLAP queries in multidimensional^ hierarchically clustered Q 
^ databases 

^ Dimitri Theodoratos, Aris Tsois 

November 2001 Proceedings of the 4th ACM international workshop on Data 

warehousing and OLAP 
Publisher: ACM Press 

Full text available: ^ pdf(1.44 MB) Additional Information: full citation , abstract , citings , index terms 

On-line analytical processing (OLAP) is a technology that encompasses applications 
requiring a multidimensional and hierarchical view of data. OLAP applications often require 
fast response time to complex grouping/aggregation queries on enormous quantities of 
data. Commercial relational database management systems use mainly multiple one- 
dimensional indexes to process OLAP queries that restrict multiple dimensions. However, 
in many cases, multidimensional access methods outperform one-dimensiona ... 

9 A survey of logical models for OLAP databases Q 
Panos Vassiliadis, Timos Sellis 

December 1999 ACM SIGMOD Record, volume 28 issue 4 
Publisher: ACM Press 

Full text available: Q pdf(6Q4.36 KB) Additional Information: full citation , abstract , citings , index terms 

In this paper, we present different proposals for multidimensional data cubes, which are 
the basic logical model for OLAP applications. We have grouped the work in the field in 
two categories: commercial tools (presented along with terminology and standards) and 
academic efforts. We further divide the academic efforts in two subcategories: the 
relational model extensions and the cube-oriented approaches. Finally, we attempt a 
comparative analysis of the various efforts. 

10 Extending complex ad-hoc OLAP Q 
Theodore Johnson, Damianos Chatziantoniou 

▼ November 1999 Proceedings of the eighth international conference on Information 
and knowledge management 
Publisher: ACM Press 

Full text available* f^i pdfd 36 MB) Additional Information: full citation, abstract, ref e rences , citin gs, index 
"^—^ terms 

Large scale data analysis and mining activities require sophisticated information extraction 
queries. Many queries require complex aggregation, and many of these aggregates are 
non-distributive. Conventional solutions to this problem involve defining User Defined 
Aggregate Functions (UDAFs). However, the use of UDAFs entails several problems. 
Defining a new UDAF can be a significant burden for the user, and optimizing queries 
involving UDAFs is difficult because of the "black box&rdqu ... 

11 Capturing summarizability with inte grit y constraints in OLAP Q 
Carlos A. Hurtado, Claudio Gutierrez, Alberto O. Mendelzon 

September 2005 ACM Transactions on Database Systems (TODS), Volume 30 issue 3 
Publisher: ACM Press 

Full text available: ^| pdf(710.79 KB) Additional Information: full citation , abstract , references , index terms 

In multidimensional data models intended for online analytic processing (OLAP), data are 
viewed as points in a multidimensional space. Each dimension has structure, described by 
a directed graph of categories, a set of members for each category, and a child/parent 
relation between members. An important application of this structure is to use it to infer 
summarizability, that is, whether an aggregate view defined for some category can be 
correctly derived from a set of precomputed views defined f ... 

Keywords: OLAP, data warehousing, integrity constraints, query-optimization, 
summarizability 
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12 OLAP , relational, and multidimensional database systems Q 

# George Colliat 
September 1996 ACM SIGMOD Record, Volume 25 issue 3 

Publisher: ACM Press 

Full text available: ^ pdf(355.59 KB) Additional Information: full citation , citings, index terms 



13 Achieving scalability in OLAP materialized view selection 
Thomas P. Nadeau, Toby J. Teorey 

November 2002 Proceedings of the 5th ACM international workshop on Data 
Warehousing and OLAP 

Publisher: ACM Press 

Full text available" f 51 ! pdf(347 21 KB) Addit ' ona ' Information: full citation, abstract, r eferences , citings, index 

terms 

The goal of on-line analytical processing (OLAP) is to quickly answer queries from large 
amounts of data residing In a data warehouse. Materialized view selection is an 
optimization problem encountered in OLAP systems. Published work on the problem of 
materialized view selection presents solutions scalable in the number of possible views. 
However, the number of possible views is exponential relative to the number of database 
dimensions. A truly scalable solution must be polynomial time relative ... 

Keywords: OLAP, OLAP performance, data warehouse, materialized views, view selection 



14 Query and view processing: Aggregate queries in peer-to-peer OLAP Q 
Mauricio Minuto Espil, Alejandro A. Vaisman 

November 2004 Proceedings of the 7th ACM international workshop on Data 
warehousing and OLAP 

Publisher: ACM Press 

Full text available: ^ pdf(375.80 KB) Additional Information: full citation, abstract, re fer e nces , inde x term s 

A .peer-to-peer (P2P) data management system consists essentially in a network of peer 
systems, each maintaining full autonomy over its own data resources. Data exchange 
between peers occurs when one of them, in the role of a local peer, needs data available 
in other nodes, denoted the acquaintances of the local peer. No global schema is assumed 
to exist for any data under this computing paradigm. Henceforth, data provided by an 
acquaintance of a local peer must be adapted, in a manner that an ... 

Keywords: OLAP, P2P computing, data warehousing 

15 Online analytic processing (OLAP): Spreadsheets in RDBMS for OLAP Q 
Andrew Witkowski, Srikanth Bellamkonda, Tolga Bozkaya, Gregory Dorman, Nathan Folkert, 
Abhinav Gupta, Lei Shen, Sankar Subramanian 

June 2003 Proceedings of the 2003 ACM SIGMOD i nternational conference on 
Management of data 

Publisher: ACM Press 

Full text available* fij?lpdf(182 22 KB) Additional Information: full citation , abstr act, refer ences, citings, index 

: terms 

One of the critical deficiencies of SQL is lack of support for n-dimensional array-based 
computations which are frequent in OLAP environments. Relational OLAP (ROLAP) 
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applications have to emulate them using joins, recently introduced SQL Window Functions 
[18] and complex and inefficient CASE expressions. The designated place in SQL for 
specifying calculations is the SELECT clause, which is extremely limiting and forces the 
user to generate queries using nested views, subqueries and complex joins ... 

16 Constructing an OLAP cube from distributed XML data 

Tapio Niemi, Marko Niinimaki, Jyrki Nummenmaa; Peter Thanisch 
November 2002 Proceedings of the 5th ACM international workshop on Data 

Warehousing and OLAP 
Publisher: ACM Press 

Full text available: ^g] pdf(200.01 KB) Additional Information: full citation , abstract , references , index terms 

On-Line Analytical Processing (OLAP) is a powerful method for analysing large data 
warehouse data. Typically, the data for an OLAP database is collected from a set of data 
repositories such as e.g. operational databases. This data set is often huge, and it may 
not be known in advance what data is required and when to perform the desired data 
analysis tasks. Sometimes it may happen that some parts of the data are only needed 
occasionally. Therefore, keeping the OLAP database constantly up-to-date ... 

Keywords: OLAP, XML, distributed data warehousing 



17 OLAP: Evaluating XML-extended OLAP queries based on a physical algebra Q 
Xuepeng Yin, Torben Bach Pedersen 

>/ November 2004 Proceedings of the 7th ACM international workshop on Data 
warehousing and OLAP 
Publisher: ACM Press 

Full text available: ^ pdf(206.65 KB) Additional Information: full citation , abstract , references , index terms 

In today's OLAP systems, integrating fast changing data, e.g., stock quotes, physically 
into a cube is complex and time consuming. The widespread use of XML makes it very 
possible that this data is available in XML format on the WWW. Thus, making XML data 
logically federated with OLAP systems is desirable. In this paper, we extend previous work 
on the logical federation of OLAP and XML data sources by presenting a simplified query 
semantics, a physical query algebra and a robust OLAP-XML quer ... 

Keywords: OLAP, XML, data integration, physical algebra, query semantics 

1 8 C u biST: a new a lg orithm for improv i n g t he perfo rm a n ce o f ad-hoc OLA P q ueri e s Q 
Lixin Fu, Joachim Hammer 

November 2000 Proceedings of the 3rd ACM international workshop on Data 
warehousing and OLAP 

Publisher: ACM Press 

Full text available: ^] pdf(296.08 KB) Additional Information: full citation , references , citings , index terms 



Keywords: OLAP, data cube, data warehouse, index structure, query optimization, query 
processing 



19 Query processing: Spatial hierarchy and OLAP-favored search in spatial data 
warehouse 

Fangyan Rao, Long Zhang, Xiu Lan Yu, Ying Li, Ying Chen 

November 2003 Proceedings of the 6th ACM international workshop on Data 
warehousing and OLAP 
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Publisher: ACM Press 

Full text available: Q pdfd 97.48 KB) Additional Information: full citation , abstract , references , index terms 

Data warehouse and Online Analytical Processing(OLAP) play a key role In business 
intelligent systems. With the increasing amount of spatial data stored in business 
database, how to utilize these spatial information to get insight into business data from 
the geo-spatial point of view is becoming an important issue of data warehouse and OLAP. 
However, traditional data warehouse and OLAP tools can not fully exploit spatial data in 
coordinates because multi-dimensional spatial data does not have im ... 

Keywords: spatial OLAP, spatial data warehouse, spatial hierarchy 



20 Online analytic p rocessing ( OLAP): QC-trees: an efficient summary structure for Q 
^ semantic OLAP 

^ Laks V. S. Lakshmanan, Jian Pei, Yan Zhao 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 



Recently, a technique called quotient cube was proposed as a summary structure for a 
data cube that preserves its semantics, with applications for online exploration and 
visualization. The authors showed that a quotient cube can be constructed very efficiently 
and it leads to a significant reduction in the cube size. While it is an interesting proposal, 
that paper leaves many issues unaddressed. Firstly, a direct representation of a quotient 
cube is not as compact as possible and thus still wast ... 
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1 Simultaneous optimization and evaluation of multiple dimensional queries 
Yihong Zhao, Prasad M. Deshpande, Jeffrey F. IMaughton, Amit Shukla 

June 1998 ACM SIGMOD Record , Proceedings of the 1998 ACM SIGMOD international 

conference on Management of data SIGMOD '98, volume 27 issue 2 
Publisher: ACM Press 

Additional Information: full citation, abstract , references , citin gs, index 



Full text available: ^ pdfd. 52 MB) 



terms 



Database researchers have made significant progress on several research issues related 
to multidimensional data analysis, including the development of fast cubing algorithms, 
efficient schemes for creating and maintaining precomputed group-bys, and the design of 
efficient storage structures for multidimensional data. However, to date there has been 
little or no work on multidimensional query optimization. Recently, Microsoft has proposed 
"OLE DB for OLAP" as a standard multidime ... 

2 Heuristic o ptimization of O LA P qu eries in m ultidimensional^ hierarchically clus tered Q 
databases 

Dimitri Theodoratos, Aris Tsois 

November 2001 Proceedings of the 4th ACM international workshop on Data 
warehousing and OLAP 

Publisher: ACM Press 

Full text available: Q pdf(1.44 MB) Additional Information: full citation , abstract , citings , index terms 

On-line analytical processing (OLAP) is a technology that encompasses applications 
requiring a multidimensional and hierarchical view of data. OLAP applications often 
require fast response time to complex grouping/aggregation queries on enormous 
quantities of data. Commercial relational database management systems use mainly 
multiple one-dimensional indexes to process OLAP queries that restrict multiple 
dimensions. However, in many cases, multidimensional access methods outperform one- 
dimensiona ... 



Extendin g complex ad-hoc OLA P 
Theodore Johnson, Damianos Chatziantoniou 

November 1999 Proceedings of the eighth international conference on Information 
and knowledge management 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
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Large scale data analysis and mining activities require sophisticated information 
extraction queries. Many queries require complex aggregation, and many of these 
aggregates are non-distributive. Conventional solutions to this problem involve defining 
User Defined Aggregate Functions (UDAFs). However, the use of UDAFs entails several 
problems. Defining a new UDAF can be a significant burden for the user, and optimizing 
queries involving UDAFs is difficult because of the "black box&rdqu ... 

4 O ptimizin g multiple dimensional queries simultaneously in multidimensional 
databases 

Weifa Liang, Maria E. Orlowska, Jeffrey X. Yu 

February 2000 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 8 Issue 3-4 
Publisher: Springer-Verlag New York, Inc. 

Full text available: pdf(269.57 KB) Additional Information: full citation , abstract , citings , index terms 

Some significant progress related to multidimensional data analysis has been achieved in 
the past few years, including the design of fast algorithms for computing datacubes, 
selecting some precomputed group-bys to materialize, and designing efficient storage 
structures for multidimensional data. However, little work has been carried out on 
multidimensional query optimization issues. Particularly the response time (or evaluation 
cost) for answering several related dimensional queries simultaneous ... 

Keywords: Data warehousing, MDDBs, Multiple dimensional query optimization, OLAP, 
Query modeling 




5 Data warehousing and OLAP: Semantic-based delivery of OLAP summary tables in 
wireless environments 
Mohamed A. Sharaf, Panos K. Chrysanthis 

November 2002 Proceedings of the eleventh international conference on Information 
and knowledge management 

Publisher: ACM Press 

Full text available* ff!) pdf(251.10 KB ) Additional Information: full citation , abstract , references , citings , index 
' ^ ' terms 

With the rapid growth in mobile and wireless technologies and the availability, 
pervasiveness and cost effectiveness of wireless networks, mobile computers are quickly 
becoming the normal front-end devices for accessing enterprise data. In this paper, we 
are addressing the issue of efficient delivery of business decision support data in the form 
of summary tables to mobile clients equipped with OLAP front-end tools. Towards this, we 
propose a new on-demand scheduling algorithm, called SBS ... 

Keywords: broadcast pull, broadcast scheduling, mobile computing 



6 An overview of data warehousing and OLAP technology 
Surajit Chaudhuri, Umeshwar Dayal 
March 1997 ACM SIGMOD Record, Volume 26 Issue l 
Publisher: ACM Press 

Full text available: *g!|pdf(101.60 KB) Additional Information: ful l citation , abstract, citings, index terms 

Data warehousing and on-line analytical processing (OLAP) are essential elements of 
decision support, which has increasingly become a focus of the database industry. Many 
commercial products and services are now available, and all of the principal database 
management system vendors now have offerings in these areas. Decision support places 
some rather different requirements on database technology compared to traditional on- 
line transaction processing applications. This paper provides an overview ... 




http://portal.acm.org/results.cfm?coll=ACM&dl==ACM&CFID=69 1 7952 1 &CFTOKEN=477... 4/1 0/06 



Results (page 1): olap multiple tables instnaces 



Page 3 of 6 



7 OLAP and statistical databases: similarities and differences 
Arie Shoshani 

May 1997 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on 
Principles of database systems 

Publisher: ACM Press 

Full text available: ^jdf(L66 MB) Additional Information: full citation , references , citings, index terms 




Online analytic processing (OLAP): Spreadsheets in RDBMS for OLAP Q 
Andrew Witkowski, Srikanth Bellamkonda, Tolga Bozkaya, Gregory Dorman, Nathan Folkert, 
Abhinav Gupta, Lei Shen, Sankar Subramanian 
June 2003 Proceedings of the 2003 ACM SIGMOD i nternational conference on 

Management of data 
Publisher: ACM Press 

Full text available- f£l pdf(182 22 KB) Additional Information: full citation , abstract , references , citings , index 

! terms 

One of the critical deficiencies of SQL is lack of support for n-dimensional array-based 
computations which are frequent in OLAP environments. Relational OLAP (ROLAP) 
applications have to emulate them using joins, recently introduced SQL Window Functions 
[18] and complex and inefficient CASE expressions. The designated place in SQL for 
specifying calculations is the SELECT clause, which is extremely limiting and forces the 
user to generate queries using nested views, subqueries and complex joins ... 

9 Industrial session: data warehousing and data mining: Bridging the ga p betw een Q 
OLAP and SQL 

Jens-Peter Dittrich, Donald Kossmann, Alexander Kreutz 

August 2005 Proceedings of the 31st international conference on Very large data 
bases VLDB '05 

Publisher: VLDB Endowment 

Full text available: ^| pdf(409.18 KB) Additional Information: full citation , abstract, references, index terms 

In the last ten years, database vendors have invested heavily in order to extend their 
products with new features for decision support. Examples of functionality that has been 
added are top N [2], ranking [13, 7], spreadsheet computations [19], grouping sets [14], 
data cube [9], and moving sums [15] in order to name just a few. Unfortunately, many 
modern OLAP systems do not use that functionality or replicate a great deal of it in 
addition to other database-related functionality. In fact, the ga ... 

10 OLAP: Hierarchical dwarfs for the rollup cube Q 
Yannis Sismanis, Antonios Deligiannakis, Yannis Kotidis, Nick Roussopoulos 
November 2003 Proceedings of the 6th ACM international workshop on Data 

warehousing and OLAP 
Publisher: ACM Press 

Full text available: ^|pdf(1 00.29 KB) Additional Information: full citation , abstract , references , index terms 

The data cube operator exemplifies two of the most important aspects of OLAP queries: 
aggregation and dimension hierarchies. In earlier work we presented Dwarf, a highly 
compressed and clustered structure for creating, storing and indexing data cubes. Dwarf 
is a complete architecture that supports queries and updates, while also including a 
tunable granularity parameter that controls the amount of materialization performed. 
However, it does not directly support dimension hierarchies. Rollup and ... 

Keywords: OLAP, aggregation, data cubes, dwarf cube, granularity, indexing, 
materialization, prefix elimination, structural redundancy, suffix coalescing, warehouses 
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11 Research p a pers: OLAP: Privac y preservin g OLAP 
Rakesh Agrawal, Ramakrishnan Srikant, Dilys Thomas 

June 2005 Proceedings of the 2005 ACM SIGMOD i nternational conference on 
Management of data 

Publisher: ACM Press 

Full text available: ^ pdf(357.72 KB) Additional Information: full citation, abstract , references 

We present techniques for privacy-preserving computation of multidimensional 
aggregates on data partitioned across multiple clients. Data from different clients is 
perturbed (randomized) in order to preserve privacy before it is integrated at the server. 
We develop formal notions of privacy obtained from data perturbation and show that our 
perturbation provides guarantees against privacy breaches. We develop and analyze 
algorithms for reconstructing counts of subcubes over perturbed data. We als ... 

12 An array-based algorithm for simultaneous multidimensional aggregates 
Yihong Zhao, Prasad M. Deshpande, Jeffrey F. Naughton 

June 1997 ACM SIGMOD Record , Proceedings of the 1997 ACM SIGMOD international 

conference on Management of data SIGMOD '97, volume 26 issue 2 
Publisher: ACM Press 

Full text available: f9 pdf(1.45MB) Additional Information: fuJLcitatign, abstract, references, citings, index 

' terms 

Computing multiple related group-bys and aggregates is one of the core operations of On- 
Line Analytical Processing. (OLAP) applications. Recently, Gray et al. [GBLP95] proposed 
the "Cube" operator, which computes group-by aggregations over all possible subsets of 
the specified dimensions. The rapid acceptance of the importance of this operator has led 
to a variant of the Cube being proposed for the SQL standard. Several efficient algorithms 
for Relational OLAP (ROLAP) have been d ... 

1 3 Searching for dependen cies at multiple abstrac tion levels 
Toon Calders, Raymond T. Ng, Jef Wijsen 

September 2002 ACM Transactions on Database Systems (TODS), volume 27 issue 3 
Publisher: ACM Press 

Full text available* f* 1 ) odf(41 1 24 KB) Ac,ditional Information: full cita tion, abstr act , references, citings, i ndex 
tmu ~ x : terms 

The notion of roll-up dependency (RUD) extends functional dependencies with 
generalization hierarchies. RUDs can be applied in OLAP and database design. The 
problem of discovering RUDs in large databases is at the center of this paper. An 
algorithm is provided that relies on a number of theoretical results. The algorithm has 
been implemented; results on two real-life datasets are given. The extension of functional 
dependency (FD) with roll-ups turns out to capture meaningful rules that are outsi ... 

Keywords: Data mining, functional dependencies, knowledge discovery, online analytical 
processing 



14 Advanced SQL modeling in RDBMS ^ Q 

A Andrew Witkowski, Srikanth Bellamkonda, Tolga Bozkaya, Nathan Folkert, Abhinav Gupta, 
^ John Haydu,.Lei Sheng, Sankar Subramanian 

March 2005 ACM Transactions on Database Systems (TODS), Volume 304ssue i 

Publisher: ACM Press 

Full text available: "g| pdf( 279.06 KB) Additional Information: full citation , abstract , references , index terms 
Commercial relational database systems lack support for complex business modeling. 
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ANSI SQL cannot treat relations as multidimensional arrays and define multiple, 
interrelated formulas over them, operations which are needed for business modeling. 
Relational OLAP (ROLAP) applications have to perform such tasks using joins, SQL 
Window Functions, complex CASE expressions, and the GROUP BY operator simulating the 
pivot operation. The designated place in SQL for calculations is the SELECT clause, whi ... 

Keywords: Excel, OLAP, analytic computations, spreadsheet 



15 OLAP: An analysis of additivity in OLAP systems 
John Horner, Il-Yeol Song, Peter P. Chen 

November 2004 Proceedings of the 7th ACM international workshop on Data 
warehousing and OLAP 

Publisher: ACM Press 

Full text available: ^[ pdf(322.41 KB) Additional Information: full citation , abstract , references , index terms 

Accurate summary data Is of paramount concern in data warehouse systems; however, 
there have been few attempts to completely characterize the ability to summarize 
measures. The sum operator is the typical aggregate operator for summarizing the large 
amount of data in these systems. We look to uncover and characterize potentially 
inaccurate summaries resulting from aggregating measures using the sum operator. We 
discuss the effect of classification hierarchies, and non-, semi-, and fully- addi ... 

Keywords: OLAP, additivity, data warehouse, metadata, summarization 



16 Research papers: OLAP: Efficient computation of multiple group by queries 
Zhimin Chen, Vivek Narasayya 

June 2005 Proceedings of the 2005 ACM SIGMOD international conference on 
Management of data 

Publisher: ACM Press 

Full text available: ^ pdf(371 .92 KB) Additional Information: full citation, abstract , references 

Data analysts need to understand the quality of data in the warehouse. This is often done 
by issuing many Group By queries on the sets of columns of interest. Since the volume of 
data in these warehouses can be large, and tables in a data warehouse often contain 
many columns, this analysis typically requires executing a large number of Group By 
queries, which can be expensive. We show that the performance of today's database 
systems for such data analysis is inadequate. We also show that the pro ... 

1 7 Ma inte nance of data cub es a nd summary tables in a ware house 
Inderpal Singh Mumick, Dalian Quass, Barinderpal Singh Mumick 

June 1997 ACM SIGMOD Record , Proceedings of the 1997 ACM SIGMOD international 

conference on Management of data SIGMOD '97, volume 26 issue 2 
Publisher: ACM Press 

Full text available* pdf(1.58MB) Additional Information: full citation , abstract , references , citings , index 
' terms 

Data warehouses contain large amounts of information, often collected from a variety of 
independent sources. Decision-support functions in a warehouse, such as on-line 
analytical processing (OLAP), involve hundreds of complex aggregate queries over large 
volumes of data. It is not feasible to compute these queries by scanning the data sets 
each time. Warehouse applications therefore build a large number of summary tables, or 
materialized aggregate views, to ... 



18 A case for dynamic view management 
Yannis Kotidis, Nick Roussopoulos 
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December 2001 ACM Transactions on Database Systems (TODS), volume 26 issue 4 
Publisher: ACM Press 

Full text available ^ df(892 57 KB) Additional Information: full citation , abstract , references , citings, index 
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Materialized aggregate views represent a set of redundant entities in a data warehouse 
that are frequently used to accelerate On-Line Analytical Processing (OLAP). Due to the 
complex structure of the data warehouse and the'different profiles of the users who 
submit queries, there is need for tools that will automate and ease the view selection and 
management processes. In this article we present DynaMat, a system that manages 
dynamic collections of materialized aggregate views in a data warehous ... 

Keywords: Data cube, OLAP, data warehousing, materialized views 
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19 Query processing: Spatial hierarchy and OLAP-favored search in spatial data Q 
w arehouse 

Fangyan Rao, Long Zhang, Xiu Lan Yu, Ying Li, Ying Chen 
November 2003 Proceedings of the 6th ACM international workshop on Data 

warehousing and OLAP 
Publisher: ACM Press 

Full text available: ^ pdfd 97.48 KB) Additional Information: full citation , abstract, references, index terms 

Data warehouse and Online Analytical Processing(OLAP) play a key role in business 
intelligent systems. With the increasing amount of spatial data stored in business 
database, how to utilize these spatial information to get insight into business data from 
the geo-spatial point of view is becoming an important issue of data warehouse and OLAP. 
However, traditional data warehouse and OLAP tools can not fully exploit spatial data in 
coordinates because multi-dimensional spatial data does not have im ... 

Keywords: spatial OLAP, spatial data warehouse, spatial hierarchy 

20 OLAP: Evaluating XML-extended OLAP queries based on a physical algebra Q 
Xuepeng Yin, Torben Bach Pedersen 

November 2004 Proceedings of the 7th ACM international workshop on Data 
warehousing and OLAP 

Publisher: ACM Press 

Full text available: ^ pdf(206.65 KB) Additional Information: full citation , abstract, references, index terms 

In today's OLAP systems, integrating fast changing data, e.g., stock quotes, physically 
into a cube is complex and time consuming. The widespread use of XML makes it very 
possible that this data is available in XML format on the WWW. Thus, making XML data 
logically federated with OLAP systems is desirable. In this paper, we extend previous work 
on the logical federation of OLAP and XML data sources by presenting a simplified query 
semantics, a physical query algebra and a robust OLAP-XML quer ... 

Keywords: OLAP, XML, data integration, physical algebra, query semantics 
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Merging Models Based on Given Correspondences - Rachel Pottinger University (2003) (5 citations) (Correct) 

....directions. The first involves showing that the Merge result, when applied to models and mappings that are 
templates for instances, has an appropriate interpretation on instances. This will demonstrate the usefulness 
of Merge in specific applications such as data integration and view integration [13, 20]. In some of our 
experiments we encountered a complex structure in one model that expressed a similar concept to a complex 
structure in another model, but there was no obvious mapping for the individual elements even though the 
structures as a whole were similar. An open question is how best to .... 

Motro, A. Superviews: Virtual Integration of Multiple Databases. Trans, on Soft. Eng., SE-13(7). 785-798. 



Merging Models Based on Given Correspondences - Rachel Pottinger University (2003) (5 citations) ( Cor rect) 

....directions. The first involves showing that the Merge result, when applied to models and mappings that are 
templates for instances, has an appropriate interpretation on instances. This will demonstrate the usefulness 
of Merge in specific applications such as data integration and view integration [21, 31]. In some of our 
experiments, we .encountered a complex structure in one model that is similar to a complex structure in another 
model. But rather than relating the elements by a complex expression, we wanted to relate individual elements 
but only in the context of that complex structure. An open .... 

Motro, A. Superviews: Virtual integration of Multiple Databases. Trans, on Soft. Eng., SE-13(7). 785-798. 



Heterogeneous Active Agents, I: Semantics - Eiter, Subrahmanian, Pick (1999) (3 citations) (Correct) . 

....Community. There is now a great deal of work in mediated systems techniques. In this paragraph, we merely 
explain the relationship between code call conditions and existing work on data and software integration. For " 
example, there have been several efforts to integrate multiple relational DBMSs [26, 72] and relational 
DBMSs, object oriented DBMSs and or file systems [36, 52, 89] However, to date, the semantics of 
mediators that take actions has not been explored. The work in this paper builds, upon mediation efforts 
reported upon in our HERMES effort described previously in [20, 61, 61, 94, 63] .... 

A. Motro, Superviews: Virtual Integration of Multiple Databases. IEEE Transactions on Software Engineering, 13 
(7):785--798, 1987. 



Query Answering in Inconsistent Databases - Bertossi. Chomicki (Correct) 

....[44] 8.2 Data integration Assume we have a collection of (materialized) data sources S 1 , S n , and a global, 
virtual database G, that integrates data from S 1 , S n . According to the local as view approach, we can look 
at the data sources, S i , as views of the global schema G [59, 74, 65]. Now, given a query Q to G, one can 
generate a query plan that extracts the information from the sources [57, 58, 39] In the global as view approach, 
the global database is defined as a view over the data sources. Sometimes one assumes that certain ICs hold in 
the global system, and those ICs .... 

Motro, A. Superviews: Virtual Integration of Multiple Databases. IEEE Transactions on Software Engineering, 
1987, 13(7):785-798. 
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....approach we use the Generic Integration Model GIM which enables an efficient algorithm to derive an 
integrated schema in a user friendly data model. The data model GIM was firstly introduced in [SS95, SS96a] 
The extensional conflict as one main conflict is topic of many publications, e.g. DH84, Mot87, MNE88, 
Bra93, SGN93, TS93, KS95, NS96] They usually resolve this conflict directly in an object oriented model 
by using specialization. The original classes are often classes of the integrated schema enriched by new super 
subclasses and specialization relationships among them. DS96] for .... 

A. Motro. Superviews: Virtual Integration of Multiple Databases. IEEE Transactions on Software Engineering, 13 
(7):785--798, July 1987. 



View Integration : A Step Forward in Solving Structural Conflicts - And (1992) (62 citations) (Correct) 

....has to confirm, or deny, the correspondences proposed by the tool. We assume that knowledge of view 
correspondences is provided by users or the DBA to the view integration tool. That knowledge may be expressed 
using either declarative or procedural statements. The latter approach is found in [26] [27], in the context of 
database integration. Following Motro, the user specifies his global 10 schema by defining how it is built from 
the set of schemas of existing databases. The mapping specification uses a set of restructuring primitives (later 
called schema editing operations) This approach .... 

A. Motro, "Superviews: Virtual Integration of Multiple Databases", IEEE Transactions On Software Engineering, 
Vol. SE-13, n 7, July 1987, 785-798 



An Integration Method for Design Schemas - Mirbel, Cavarero (1996) (1 citation) (Correct) 

....the result presentation phase. After that, we indicate the strategies for using this process. In the last section, 
we conclude. 2 Existing Work A lot of different work has been done on schema integration. This goes from 
works on the tools to help the designer to get two schemas without conflicts [1], to work on the formal 
presentation of the integration rules [2] An overview of the variety of existing work can be found in [3] 
Generally, the integration process is divided in four phases. The first phase is the pre integration one. It 
consists in translating the schemas under study into a .... 

. .. These differences react on the structure chosen to represent the phenomenon, but also on the 
terminology used to name it (homonymy and synonymy conflicts) The less recent approaches describe 
how to merge several schemas with regard to a set of correspondences between the schemas under 
study [1, 6, 4]. These approaches were based on relational, functional and entity relationship models. The 
interschema assertions had to be supplied by the designer, and the conflict solving depended on the designer s 
common sense. Some recent approaches, in most cases based on object oriented models, proposed .... 

A. Motro. Superviews : virtual integration of multiple databases. Transaction on software engineering, SE-13 
(7):785~798, July 1987. 
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....using first order logic. Research in the area of schema integration has been carried out since the beginning of 
the 1980s. A comprehensive survey of the area can be found in [Batini86] Most of the 3 work has been 
done in the context of the relational model [Biskup86] the functional model [Motro87], or (some extended 
version of) the ER model, Larson89] Spaccapietra92] Johannesson93a] Johannesson93b] 
Johannesson93c] Most work has focused on how to merge a number of schemas given a set of proposed 
schema correspondences. In recent years, however, several researchers have also shown .... 

A. Motro, "Superviews: Virtual Integration of Multiple Databases", IEEE Transactions on Software Engineering, 
vol. 13, no. 7, pp. 785-798, 1987. 



Theoretical Foundations of Schema Restructuring in Heterogeneous.. - Albert (2000) (5 citations) (Correct) 
....external databases were mapped to schemas in the common data model, and similar schema constructs from 



http://citeseer.ist.psu.edu/context/7296870 



4/10/06 



' Citations: Superviews: Virtual Integration of Multiple Databases - Motro (Researchlndex) Page 3 of 6 



di erent schemas were integrated into generalization hierarchies. Subsequently, many other systems were 
proposed or implemented with integration based on an outerjoin or generalization mechanism [1, 10, 11, 
12, 13, 23, 31, 36, 33],aswell as systems which allowed for more general ways in which objects may be 
related or merged [5, 15, 24, 37, 42] The systems and techniques just referenced all provide support for 
what was referred to as data integration in the previous section. Data integration is the process of 
merging .... 
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virtual multi database containing homonyms, synonyms and redundant data possibly organised in several 

different ways. There is, however, a transformation which maps an instance of this multi database to an 
instance of our warehouse schema so that the warehouse turns out to be a view (or superview) [9, 11] in 
the most general sense. Even if we assume a copy semantics, i.e. data in operative databases will be time 
stamped and added to the existing data in the view, this will not cause a significant change. The basic assumption 
of data warehouses being organised as separate databases then turns .... 

A. Motro. Superviews: Virtual integration of multiple databases. IEEE ToSE, 13(7), 1987. 
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the most general sense. Even if we assume a copy semantics, i.e. data in operative databases will be time 
stamped and added to the existing data in the view, this will not cause a signi cant change. The basic assumption 
of data warehouses being organized as separate databases then turns into .... 

A. Motro. Superviews: Virtual integration of multiple databases. IEEE ToSE, 13(7), 1987. 
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....Information units depend on the database schema. They represent data in a standard, intuitive framework that 
allow high performance access. Information units modelling can be compared with the modelling of 
semistructured data. Then information units turn out to be generalized views on the database [3, 14]. The 
generalization should support data condensation and supplementary facilities to enable an adequate 
representation to the user. We restrict the rule system used for generating units from the database to the smallest 
possible system. The rule system can be extended by inclusion of di erent .... 

A. Motro, Superviews: Virtual integration of multiple databases. IEEE ToSE, 13, 7, July, 1987. 
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....federated database systems (FDBS) integration is a function regularly performed at different levels and by 
different services depending on the organization of the FDBS environment. The value of the problem is well 
known, various approaches, techniques and sometimes tools were proposed (see, eg, [7, 22, 35, 36, 30, 
29, 26] and surveys [2, 27] In spite of the diversity of approaches, several common points can be well 
identified. Integration consists of schema integration composing a global schema from the set of local ones, and 
data integration computing virtual extension of the global schema. Schema .... 

A. Motro. Superviews: Virtualintegration of multiple databases. IEEE TOSE, 13(7):785~798, 1987. 
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....Schema integration is a main component of conceptual design which is itself a part of the overall activity of 
database design. This explains the significant interest in schema integration methodologies: a vast 
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diversity of various approaches, techniques and sometimes tools were proposed (see, eg, [10, 30, 42, 43, 
40, 39, 36] and surveys [4, 37, 33] Moreover, to date the value of the issue has increased greatly due to the 
evident tendency of organizing modern (and of the nearest future) information systems on cooperative 
federal principals. Indeed, in the context of federated database systems (FDBS) integration is .... 

A. Motro. Superviews: Virtual integration of multiple databases. IEEE TOSE, 13(7):785-798, 1987. 
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....show the interest of separating the conflict declaration and the conflict resolution. Finally, the conclusion 
analyzes possible extensions to our work. 3 2 Schema Comparison Procedural integration methodologies 
propose to successively modify input schemas, by using a set of operators [4] 10] [15], 24] to remove 
discrepancies and put them in conformity [1] No semantic verification can be performed and erroneous 
manipulations cannot be detected by the system. Moreover, this approach leaves to the user the responsibility 
of solving conflicts. On the contrary, assertion based methodologies .... 

A semantic conflict between two related classes 1 occurs when the corresponding sets of real world objects 

are not exactly the same. This kind of conflict has been extensively dealt with in the literature and is 
usually solved with the use of the generalization (Is A) concept [4] 14] [15]. More recently, the multi 
instanciation concept or May Be A link [17] adds a new perspective. Moreover, the administrator can choose how 
to use this concept, among all possibilities, to solve semantic conflicts. A structural conflict occurs when related 
real world items are modeled with .... 

Motro A. f "Superviews: Virtual Integration of Multiple Databases", IEEE Transaction on Software Engineering, Vol. 
SE-13, n 7, July 1987,785-798 



A Basis for Interactive Schema Merging - Buneman, Davidson, Kosky.. (1992) (7 citations) (Corr ect ) 

....perform some sort of translation from potentially different underlying database models to a common 
intermediary model. Various approaches to schema merging have been proposed; see [2] for a survey. These 
vary from sets of tools for manipulating two schemas into some form of consistency ([3, 4]) to algorithms 
which take two schemas together with some constraints and produce a merged schema. In practice a 
method that lies somewhere between these two extremes is usually, desirable: a certain amount of user 
intervention is inevitable, though, once any conflicts have been resolved and the .... 

....says that arrows are, in some sense, preserved by specialization, so that, if p is a specialization of q and q has 
an a arrow of class s, then p also has an a arrow with class at least as specific as s. These axioms are 
equivalent to those given for functional schemas in [4] and also in [3] (though the latter used unlabeled 
arrows) Let us consider the ER diagram of a University Business Office database shown in figure 3. The 

diagram indicates that there are two ways of classifying University Employees (UE) as research employees (Res 
UE) or as academic employees (Ac UE) Teaching .... 
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....[24] this approach developed integration rules for a variety of data models. A generic description of schema 
correspondences among different data models was provided. Other approaches defined a set of operators to 
build a virtual integration of multiple databases or to customize virtual classes [19], 20] 22] Yet another 
approach asserted that the different constructs of component schemas be standardized before the 
integration. Several transformation rules were then proposed for the view integration process [12] In this paper, 
we present a schema integration mechanism to achieve a .... 

....generic data model. Besides, we consider the construction of class hierarchies in the integrated schema, 
which was not discussed in [24] In the process of schema integration, we use integration operators to restructure 
and integrate the component schemas. Different from the operators provided in [19] and [22] our class 
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restructuring operators can be used to restructure the attributes and class hierarchies of a class. Thus, 
the conflicts in component schemas can be resolved before the integration. The class integration operators can 
be used to integrate the information in two classes or a set .... 
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under Contract No. NSC 83 0408 E 007 030. was provided. Other approaches defined a set of operators to 
build a virtual integration of multiple databases or to customize virtual classes [18], 23] 24] Yet another 
approach asserted that the different constructs of component schemas be standardized before the 
integration. Several transformation rules were then proposed for the view integration process [13] In this paper, 
we present a schema integration mechanism to achieve a .... 

....generic data model. Besides, we consider the construction of class hierarchies in the integrated schema, 
which was not discussed in [20] In the process of schema integration, we use integration operators to restructure 
and integrate the component schemas. Different from the operators provided in [18] and [24] our class 
restructuring operators can be used to restructure the attributes and class hierarchies of a class. Thus, 
the conflicts in component schemas can be resolved before the integration. The class integration operators can 
be used to integrate the information in two classes or a set .... 
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....that will allow different database systems to exchange information. For the most part, this research has been 
either in the context of view integration as part of the database design process (e.g. 4, 9, 12] or in the 
context of systems for virtual merging of independent databases (e.g. [13, 14, 7]) For overviews of this 
area, see [2, 5] Compared to this focus on intensional inconsistency, the equally challenging problem of 
extensional inconsistency has received much less attention. Extensional inconsistencies surface only after 
all intensional inconsistencies have been resolved, at a .... 

....3. A collection (D 1 ; C 1 ; d 1 ) D n ; C n ; d n ) of databases. 4. A collection (D; D 1 ) D; D n ) of scheme 
mappings. 5. A collection (C; C 1 ) C; C n ) of constraint mappings. This definition may be considered a 
formalization of virtual databases defined in [7]. Scheme mapping may be considered an abstraction of the 
different solutions that have been advanced to the task of relating global schemes to individual schemes (e.g. 7, 
1,13] Note that the mappings from D and C to the individual databases are not necessarily total ; i.e. not all 
views .... 
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....from a collection of distributed, heterogeneous and overlapping databases. 1 A standard approach to this 
problem has been to integrate the independent databases by means of a comprehensive global scheme 
that models the information contained in the entire collection of databases (for example, [23, 38, 7, 30]) 
This global scheme is fitted with a mapping that defines the elements of the global scheme in terms of 
elements of the schemes of the member databases. Algorithms are designed to interpret queries on the global 
scheme. Such global queries are translated (using the information captured in the .... 
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....in one database with equivalent views (or satisfaction of equivalent constraints) in another database. In a 
multidatabase, the former database is the global database, and the latter is a member database. This definition 
may be considered a formalization of virtual databases defined in [30]. Scheme mapping may be considered 
an abstraction of different solutions that have been advanced to the task of relating global schemes to schemes of 
member databases (e.g. 23, 38, 7, 30] 1 1 3.5 Integrability Assumptions The purpose of multidatabases is to 
integrate information from .... 
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